Confusion Graph: Detecting Confusion Communities in Large Scale Image Classification
نویسندگان
چکیده
For deep CNN-based image classification models, we observe that confusions between classes with high visual similarity are much stronger than those where classes are visually dissimilar. With these unbalanced confusions, classes can be organized in communities, which is similar to cliques of people in the social network. Based on this, we propose a graph-based tool named “confusion graph” to quantify these confusions and further reveal the community structure inside the database. With this community structure, we can diagnose the model’s weaknesses and improve the classification accuracy using specialized expert sub-nets, which is comparable to other state-of-the-art techniques. Utilizing this community information, we can also employ pre-trained models to automatically identify mislabeled images in the large scale database. With our method, researchers just need to manually check approximate 3% of the ILSVRC2012 classification database to locate almost all mislabeled samples.
منابع مشابه
Visual Concept Learning: Combining Machine Vision and Bayesian Generalization on Concept Hierarchies Supplementary Materials
In the supplementary materials, we present more details to the data collection procedure, and more details about the training of our image classification component, including the large-scale classifier training and the confusion matrix estimation.
متن کاملExploiting confusion matrices for automatic generation of topic hierarchies and scaling up multi-way classifiers
A common way to evaluate a multi-way classifier is a confusion matrix that plots, for each of the learned concepts, the true class of test instances against the predicted classes. Aggregate accuracy figures of the classifier are obtained by summing up the diagonal entries of the confusion matrix. However, invaluable information about the relationships amongst classes is often ignored. In this r...
متن کاملFar-IR Detection Limits I: Sky Confusion Due to Galactic Cirrus
Fluctuations in the brightness of the background radiation can lead to confusion with real point sources. Such background emission confusion will be important for infrared observations with relatively large beam sizes since the amount of fluctuation tends to increase with angular scale. In order to quantitively assess the effect of the background emission on the detection of point sources for c...
متن کاملThe Potential Use of Very High Spatial Resolution Data and Object-based Classification for Mapping Urban Sprawl
We present an image analysis flowchart, based on the use of object-oriented software (eCognition). The maps obtained identify the urban landscape at different scale levels (urban areas and buildings – settlements) and at different organization levels (dense, discontinuous or low-density urban landscape). A particular effort has been made to assess the quality of the results (confusion matrix, K...
متن کاملA Machine Learning Based Framework for Verification and Validation of Massive Scale Image Data
Big data validation and system verification are crucial for ensuring the quality of big data applications. However, a rigorous technique for such tasks is yet to emerge. During the past decade, we have developed a big data system called CMA for investigating the classification of biological cells based on cell morphology that is captured in diffraction images. CMA includes a group of scientific...
متن کامل